Dissecting a Hidden Gene Duplication: The Arabidopsis thaliana SEC10 Locus

نویسندگان

  • Nemanja Vukašinović
  • Fatima Cvrčková
  • Marek Eliáš
  • Rex Cole
  • John E. Fowler
  • Viktor Žárský
  • Lukáš Synek
چکیده

Repetitive sequences present a challenge for genome sequence assembly, and highly similar segmental duplications may disappear from assembled genome sequences. Having found a surprising lack of observable phenotypic deviations and non-Mendelian segregation in Arabidopsis thaliana mutants in SEC10, a gene encoding a core subunit of the exocyst tethering complex, we examined whether this could be explained by a hidden gene duplication. Re-sequencing and manual assembly of the Arabidopsis thaliana SEC10 (At5g12370) locus revealed that this locus, comprising a single gene in the reference genome assembly, indeed contains two paralogous genes in tandem, SEC10a and SEC10b, and that a sequence segment of 7 kb in length is missing from the reference genome sequence. Differences between the two paralogs are concentrated in non-coding regions, while the predicted protein sequences exhibit 99% identity, differing only by substitution of five amino acid residues and an indel of four residues. Both SEC10 genes are expressed, although varying transcript levels suggest differential regulation. Homozygous T-DNA insertion mutants in either paralog exhibit a wild-type phenotype, consistent with proposed extensive functional redundancy of the two genes. By these observations we demonstrate that recently duplicated genes may remain hidden even in well-characterized genomes, such as that of A. thaliana. Moreover, we show that the use of the existing A. thaliana reference genome sequence as a guide for sequence assembly of new Arabidopsis accessions or related species has at least in some cases led to error propagation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Differential Expression of Arabidopsis thaliana Acid Phosphatases in Response to Abiotic Stresses

The objective of this research is to identify Arabidopsis thaliana genes encoding acid phosphatases induced by phosphate starvation. Multiple alignments of eukaryotic acid phosphatase amino acid sequences led to the classification of these proteins into four groups including purple acid phosphatases (PAPs). Specific primers were degenerated and designed based on conserved sequences of PAPs isol...

متن کامل

Negative control of Strictisidine synthase like-7 gene on salt stress resistance in Arabidopsis thaliana

Strictosidine synthase-like (SSL) is a group of gene families in the Arabidopsis genome, which whose orthologues in other plants are key enzymes in mono-terpenoid indole-alkaloid biosynthesis pathway. The SSL7 is upregulated upon treatments of Arabidopsis plants with signaling molecules such as SA, methyl jasmonate and ethylene. To find the functional role of the gene, a T-DNA-mediated knockout...

متن کامل

The hidden duplication past of Arabidopsis thaliana.

Analysis of the genome sequence of Arabidopsis thaliana shows that this genome, like that of many other eukaryotic organisms, has undergone large-scale gene duplications or even duplications of the entire genome. However, the high frequency of gene loss after duplication events reduces colinearity and therefore the chance of finding duplicated regions that, at the extreme, no longer share homol...

متن کامل

Gene transcriptomic profile in arabidopsis thaliana mediated by radiation-induced bystander effects

Background: The in vivo radiation-induced bystander effects (RIBE) at the developmental, genetic, and epigenetic levels have been well demonstrated using model plant Arabidopsis thaliana (A. thaliana). However, the mechanisms underlying RIBE in plants are not clear, especially lacking a comprehensive knowledge about the genes and biological pathways involved in the RIBE in plants. Materials and...

متن کامل

Natural diversity in flowering responses of Arabidopsis thaliana caused by variation in a tandem gene array.

Tandemly arrayed genes that belong to gene families characterize genomes of many organisms. Gene duplication and subsequent relaxation of selection can lead to the establishment of paralogous cluster members that may evolve along different trajectories. Here, we report on the structural variation in MADS AFFECTING FLOWERING 2 (MAF2) gene, one member of the tandemly duplicated cluster of MADS-bo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2014